Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 52694 |
| Missing cells | 305808 |
| Missing cells (%) | 20.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 11.7 MiB |
| Average record size in memory | 232.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 19 |
Registration_Date has a high cardinality: 1201 distinct values | High cardinality |
Education_Score has a high cardinality: 211 distinct values | High cardinality |
First_Interaction has a high cardinality: 1468 distinct values | High cardinality |
Health_Camp_ID is highly overall correlated with Camp_Start_Date and 3 other fields | High correlation |
Var1 is highly overall correlated with Var2 and 2 other fields | High correlation |
Var2 is highly overall correlated with Var1 and 2 other fields | High correlation |
Var5 is highly overall correlated with Var1 and 2 other fields | High correlation |
Donation is highly overall correlated with outcome and 1 other fields | High correlation |
Health_Score is highly overall correlated with outcome and 1 other fields | High correlation |
Health Score is highly overall correlated with outcome and 2 other fields | High correlation |
Number_of_stall_visited is highly overall correlated with Last_Stall_Visited_Number and 4 other fields | High correlation |
Last_Stall_Visited_Number is highly overall correlated with Number_of_stall_visited and 4 other fields | High correlation |
Var3 is highly overall correlated with Var1 and 3 other fields | High correlation |
outcome is highly overall correlated with Donation and 5 other fields | High correlation |
Online_Follower is highly overall correlated with Twitter_Shared | High correlation |
Twitter_Shared is highly overall correlated with Online_Follower and 1 other fields | High correlation |
Facebook_Shared is highly overall correlated with Twitter_Shared | High correlation |
Age is highly overall correlated with Var3 | High correlation |
Camp_Start_Date is highly overall correlated with Health_Camp_ID and 5 other fields | High correlation |
Camp_End_Date is highly overall correlated with Health_Camp_ID and 4 other fields | High correlation |
Category1 is highly overall correlated with Health_Camp_ID and 8 other fields | High correlation |
Category2 is highly overall correlated with Health_Camp_ID and 5 other fields | High correlation |
Category3 is highly overall correlated with Health Score and 4 other fields | High correlation |
Var3 is highly imbalanced (99.5%) | Imbalance |
Var4 is highly imbalanced (94.2%) | Imbalance |
Online_Follower is highly imbalanced (69.0%) | Imbalance |
LinkedIn_Shared is highly imbalanced (65.2%) | Imbalance |
Twitter_Shared is highly imbalanced (70.1%) | Imbalance |
Facebook_Shared is highly imbalanced (69.2%) | Imbalance |
Education_Score is highly imbalanced (83.0%) | Imbalance |
Age is highly imbalanced (57.4%) | Imbalance |
Category3 is highly imbalanced (95.2%) | Imbalance |
City_Type has 23236 (44.1%) missing values | Missing |
Employer_Category has 42095 (79.9%) missing values | Missing |
Donation has 48337 (91.7%) missing values | Missing |
Health_Score has 48337 (91.7%) missing values | Missing |
Health Score has 47214 (89.6%) missing values | Missing |
Number_of_stall_visited has 48179 (91.4%) missing values | Missing |
Last_Stall_Visited_Number has 48179 (91.4%) missing values | Missing |
Var1 is highly skewed (γ1 = 20.76283178) | Skewed |
Var2 is highly skewed (γ1 = 26.92694979) | Skewed |
Var1 has 47284 (89.7%) zeros | Zeros |
Var2 has 50989 (96.8%) zeros | Zeros |
Var5 has 48546 (92.1%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-26 08:06:14.152718 |
|---|---|
| Analysis finished | 2023-01-26 08:06:41.096093 |
| Duration | 26.94 seconds |
| Software version | pandas-profiling vv3.6.3 |
| Download configuration | config.json |
Patient_ID
Real number (ℝ)
| Distinct | 24632 |
|---|---|
| Distinct (%) | 46.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 507203.62 |
| Minimum | 485679 |
|---|---|
| Maximum | 528657 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 485679 |
|---|---|
| 5-th percentile | 487795 |
| Q1 | 496444.25 |
| median | 507241 |
| Q3 | 517919 |
| 95-th percentile | 526526.7 |
| Maximum | 528657 |
| Range | 42978 |
| Interquartile range (IQR) | 21474.75 |
Descriptive statistics
| Standard deviation | 12408.581 |
|---|---|
| Coefficient of variation (CV) | 0.024464693 |
| Kurtosis | -1.1983697 |
| Mean | 507203.62 |
| Median Absolute Deviation (MAD) | 10725 |
| Skewness | -0.004342852 |
| Sum | 2.6726588 × 1010 |
| Variance | 1.5397288 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 516956 | 25 | < 0.1% |
| 509188 | 22 | < 0.1% |
| 490196 | 21 | < 0.1% |
| 513633 | 21 | < 0.1% |
| 502457 | 21 | < 0.1% |
| 496296 | 20 | < 0.1% |
| 517006 | 19 | < 0.1% |
| 495998 | 19 | < 0.1% |
| 512069 | 19 | < 0.1% |
| 492396 | 18 | < 0.1% |
| Other values (24622) | 52489 |
| Value | Count | Frequency (%) |
| 485679 | 2 | |
| 485681 | 1 | |
| 485685 | 1 | |
| 485686 | 1 | |
| 485690 | 2 | |
| 485691 | 1 | |
| 485697 | 1 | |
| 485698 | 2 | |
| 485699 | 1 | |
| 485701 | 1 |
| Value | Count | Frequency (%) |
| 528657 | 5 | |
| 528655 | 2 | < 0.1% |
| 528650 | 2 | < 0.1% |
| 528649 | 3 | |
| 528648 | 1 | < 0.1% |
| 528647 | 3 | |
| 528646 | 1 | < 0.1% |
| 528645 | 3 | |
| 528643 | 1 | < 0.1% |
| 528642 | 1 | < 0.1% |
Health_Camp_ID
Real number (ℝ)
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6547.5982 |
| Minimum | 6523 |
|---|---|
| Maximum | 6587 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 6523 |
|---|---|
| 5-th percentile | 6526 |
| Q1 | 6534 |
| median | 6541 |
| Q3 | 6562 |
| 95-th percentile | 6585 |
| Maximum | 6587 |
| Range | 64 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 19.265787 |
|---|---|
| Coefficient of variation (CV) | 0.0029424204 |
| Kurtosis | -0.87012188 |
| Mean | 6547.5982 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.70791228 |
| Sum | 3.4501914 × 108 |
| Variance | 371.17053 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6543 | 4617 | 8.8% |
| 6527 | 2865 | 5.4% |
| 6538 | 2742 | 5.2% |
| 6537 | 2716 | 5.2% |
| 6529 | 2662 | 5.1% |
| 6526 | 2646 | 5.0% |
| 6534 | 2529 | 4.8% |
| 6570 | 2520 | 4.8% |
| 6580 | 2450 | 4.6% |
| 6578 | 1966 | 3.7% |
| Other values (34) | 24981 |
| Value | Count | Frequency (%) |
| 6523 | 1464 | |
| 6524 | 98 | 0.2% |
| 6526 | 2646 | |
| 6527 | 2865 | |
| 6528 | 1245 | |
| 6529 | 2662 | |
| 6530 | 203 | 0.4% |
| 6531 | 86 | 0.2% |
| 6532 | 1449 | |
| 6534 | 2529 |
| Value | Count | Frequency (%) |
| 6587 | 47 | 0.1% |
| 6586 | 1818 | |
| 6585 | 998 | 1.9% |
| 6581 | 1055 | |
| 6580 | 2450 | |
| 6578 | 1966 | |
| 6575 | 63 | 0.1% |
| 6571 | 1462 | |
| 6570 | 2520 | |
| 6569 | 134 | 0.3% |
Registration_Date
Categorical
| Distinct | 1201 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 231 |
| Missing (%) | 0.4% |
| Memory size | 411.8 KiB |
| 28/03/06 | 611 |
|---|---|
| 08/05/05 | 407 |
| 31/03/06 | 357 |
| 24/12/04 | 342 |
| 05/01/05 | 331 |
| Other values (1196) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 419704 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 14/05/05 |
|---|---|
| 2nd row | 26/05/06 |
| 3rd row | 07/01/04 |
| 4th row | 12/02/04 |
| 5th row | 14/03/04 |
Common Values
| Value | Count | Frequency (%) |
| 28/03/06 | 611 | 1.2% |
| 08/05/05 | 407 | 0.8% |
| 31/03/06 | 357 | 0.7% |
| 24/12/04 | 342 | 0.6% |
| 05/01/05 | 331 | 0.6% |
| 24/05/05 | 326 | 0.6% |
| 14/05/05 | 324 | 0.6% |
| 30/03/06 | 305 | 0.6% |
| 18/06/05 | 290 | 0.6% |
| 07/05/05 | 256 | 0.5% |
| Other values (1191) | 48914 |
Length
| Value | Count | Frequency (%) |
| 28/03/06 | 611 | 1.2% |
| 08/05/05 | 407 | 0.8% |
| 31/03/06 | 357 | 0.7% |
| 24/12/04 | 342 | 0.7% |
| 05/01/05 | 331 | 0.6% |
| 24/05/05 | 326 | 0.6% |
| 14/05/05 | 324 | 0.6% |
| 30/03/06 | 305 | 0.6% |
| 18/06/05 | 290 | 0.6% |
| 07/05/05 | 256 | 0.5% |
| Other values (1191) | 48914 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 115669 | |
| / | 104926 | |
| 1 | 47449 | |
| 5 | 36623 | 8.7% |
| 2 | 31268 | 7.5% |
| 4 | 21655 | 5.2% |
| 6 | 20644 | 4.9% |
| 3 | 14787 | 3.5% |
| 8 | 9504 | 2.3% |
| 9 | 8861 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 314778 | |
| Other Punctuation | 104926 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 115669 | |
| 1 | 47449 | |
| 5 | 36623 | 11.6% |
| 2 | 31268 | 9.9% |
| 4 | 21655 | 6.9% |
| 6 | 20644 | 6.6% |
| 3 | 14787 | 4.7% |
| 8 | 9504 | 3.0% |
| 9 | 8861 | 2.8% |
| 7 | 8318 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 104926 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 419704 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 115669 | |
| / | 104926 | |
| 1 | 47449 | |
| 5 | 36623 | 8.7% |
| 2 | 31268 | 7.5% |
| 4 | 21655 | 5.2% |
| 6 | 20644 | 4.9% |
| 3 | 14787 | 3.5% |
| 8 | 9504 | 2.3% |
| 9 | 8861 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 419704 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 115669 | |
| / | 104926 | |
| 1 | 47449 | |
| 5 | 36623 | 8.7% |
| 2 | 31268 | 7.5% |
| 4 | 21655 | 5.2% |
| 6 | 20644 | 4.9% |
| 3 | 14787 | 3.5% |
| 8 | 9504 | 2.3% |
| 9 | 8861 | 2.1% |
Var1
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 120 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.84026644 |
| Minimum | 0 |
|---|---|
| Maximum | 288 |
| Zeros | 47284 |
| Zeros (%) | 89.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 288 |
| Range | 288 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 8.0819472 |
|---|---|
| Coefficient of variation (CV) | 9.6183148 |
| Kurtosis | 510.74627 |
| Mean | 0.84026644 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.762832 |
| Sum | 44277 |
| Variance | 65.31787 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47284 | |
| 1 | 1733 | 3.3% |
| 2 | 971 | 1.8% |
| 3 | 756 | 1.4% |
| 4 | 378 | 0.7% |
| 5 | 263 | 0.5% |
| 6 | 150 | 0.3% |
| 7 | 134 | 0.3% |
| 8 | 99 | 0.2% |
| 9 | 94 | 0.2% |
| Other values (110) | 832 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 47284 | |
| 1 | 1733 | 3.3% |
| 2 | 971 | 1.8% |
| 3 | 756 | 1.4% |
| 4 | 378 | 0.7% |
| 5 | 263 | 0.5% |
| 6 | 150 | 0.3% |
| 7 | 134 | 0.3% |
| 8 | 99 | 0.2% |
| 9 | 94 | 0.2% |
| Value | Count | Frequency (%) |
| 288 | 2 | |
| 286 | 1 | |
| 277 | 1 | |
| 271 | 1 | |
| 259 | 1 | |
| 252 | 1 | |
| 243 | 1 | |
| 238 | 2 | |
| 236 | 1 | |
| 227 | 1 |
Var2
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.25849243 |
| Minimum | 0 |
|---|---|
| Maximum | 156 |
| Zeros | 50989 |
| Zeros (%) | 96.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 156 |
| Range | 156 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.9826786 |
|---|---|
| Coefficient of variation (CV) | 15.407332 |
| Kurtosis | 822.24522 |
| Mean | 0.25849243 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 26.92695 |
| Sum | 13621 |
| Variance | 15.861729 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 50989 | |
| 1 | 680 | 1.3% |
| 2 | 358 | 0.7% |
| 3 | 190 | 0.4% |
| 4 | 70 | 0.1% |
| 5 | 54 | 0.1% |
| 6 | 41 | 0.1% |
| 9 | 29 | 0.1% |
| 7 | 23 | < 0.1% |
| 10 | 22 | < 0.1% |
| Other values (61) | 238 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 50989 | |
| 1 | 680 | 1.3% |
| 2 | 358 | 0.7% |
| 3 | 190 | 0.4% |
| 4 | 70 | 0.1% |
| 5 | 54 | 0.1% |
| 6 | 41 | 0.1% |
| 7 | 23 | < 0.1% |
| 8 | 14 | < 0.1% |
| 9 | 29 | 0.1% |
| Value | Count | Frequency (%) |
| 156 | 1 | < 0.1% |
| 150 | 2 | < 0.1% |
| 148 | 1 | < 0.1% |
| 147 | 4 | |
| 141 | 1 | < 0.1% |
| 138 | 1 | < 0.1% |
| 133 | 2 | < 0.1% |
| 131 | 2 | < 0.1% |
| 129 | 1 | < 0.1% |
| 123 | 8 |
Var3
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 22 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 52672 | |
| 1 | 22 | < 0.1% |
Var4
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 503 |
| 2 | 226 |
| 3 | 63 |
| 4 | 11 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 51891 | |
| 1 | 503 | 1.0% |
| 2 | 226 | 0.4% |
| 3 | 63 | 0.1% |
| 4 | 11 | < 0.1% |
Var5
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2531218 |
| Minimum | 0 |
|---|---|
| Maximum | 31 |
| Zeros | 48546 |
| Zeros (%) | 92.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 31 |
| Range | 31 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.2510854 |
|---|---|
| Coefficient of variation (CV) | 4.9426223 |
| Kurtosis | 115.60937 |
| Mean | 0.2531218 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.9185138 |
| Sum | 13338 |
| Variance | 1.5652148 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 48546 | |
| 1 | 1481 | 2.8% |
| 2 | 791 | 1.5% |
| 3 | 654 | 1.2% |
| 4 | 375 | 0.7% |
| 5 | 231 | 0.4% |
| 6 | 155 | 0.3% |
| 7 | 146 | 0.3% |
| 8 | 102 | 0.2% |
| 10 | 44 | 0.1% |
| Other values (20) | 169 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 48546 | |
| 1 | 1481 | 2.8% |
| 2 | 791 | 1.5% |
| 3 | 654 | 1.2% |
| 4 | 375 | 0.7% |
| 5 | 231 | 0.4% |
| 6 | 155 | 0.3% |
| 7 | 146 | 0.3% |
| 8 | 102 | 0.2% |
| 9 | 41 | 0.1% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 29 | 3 | < 0.1% |
| 27 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 4 | |
| 23 | 1 | < 0.1% |
| 22 | 8 | |
| 21 | 3 | < 0.1% |
| 20 | 9 |
outcome
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 38354 | |
| 1 | 14340 | 27.2% |
Online_Follower
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 2932 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 49762 | |
| 1 | 2932 | 5.6% |
LinkedIn_Shared
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 3436 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 49258 | |
| 1 | 3436 | 6.5% |
Twitter_Shared
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 2790 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 49904 | |
| 1 | 2790 | 5.3% |
Facebook_Shared
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 0 | |
|---|---|
| 1 | 2907 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 49787 | |
| 1 | 2907 | 5.5% |
Income
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| None | |
|---|---|
| 0 | |
| 1 | 3726 |
| 2 | 2830 |
| 3 | 1535 |
| Other values (3) | 1225 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.1347402 |
| Min length | 1 |
Characters and Unicode
| Total characters | 165182 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 37496 | |
| 0 | 5882 | 11.2% |
| 1 | 3726 | 7.1% |
| 2 | 2830 | 5.4% |
| 3 | 1535 | 2.9% |
| 4 | 743 | 1.4% |
| 5 | 300 | 0.6% |
| 6 | 182 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 37496 | |
| 0 | 5882 | 11.2% |
| 1 | 3726 | 7.1% |
| 2 | 2830 | 5.4% |
| 3 | 1535 | 2.9% |
| 4 | 743 | 1.4% |
| 5 | 300 | 0.6% |
| 6 | 182 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 37496 | |
| o | 37496 | |
| n | 37496 | |
| e | 37496 | |
| 0 | 5882 | 3.6% |
| 1 | 3726 | 2.3% |
| 2 | 2830 | 1.7% |
| 3 | 1535 | 0.9% |
| 4 | 743 | 0.4% |
| 5 | 300 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 112488 | |
| Uppercase Letter | 37496 | 22.7% |
| Decimal Number | 15198 | 9.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5882 | |
| 1 | 3726 | |
| 2 | 2830 | |
| 3 | 1535 | 10.1% |
| 4 | 743 | 4.9% |
| 5 | 300 | 2.0% |
| 6 | 182 | 1.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 37496 | |
| n | 37496 | |
| e | 37496 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 37496 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149984 | |
| Common | 15198 | 9.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5882 | |
| 1 | 3726 | |
| 2 | 2830 | |
| 3 | 1535 | 10.1% |
| 4 | 743 | 4.9% |
| 5 | 300 | 2.0% |
| 6 | 182 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| N | 37496 | |
| o | 37496 | |
| n | 37496 | |
| e | 37496 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 165182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 37496 | |
| o | 37496 | |
| n | 37496 | |
| e | 37496 | |
| 0 | 5882 | 3.6% |
| 1 | 3726 | 2.3% |
| 2 | 2830 | 1.7% |
| 3 | 1535 | 0.9% |
| 4 | 743 | 0.4% |
| 5 | 300 | 0.2% |
Education_Score
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| None | |
|---|---|
| 82 | 343 |
| 79 | 307 |
| 75 | 285 |
| 86 | 280 |
| Other values (206) |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 3.7983452 |
| Min length | 2 |
Characters and Unicode
| Total characters | 200150 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 43 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 45711 | |
| 82 | 343 | 0.7% |
| 79 | 307 | 0.6% |
| 75 | 285 | 0.5% |
| 86 | 280 | 0.5% |
| 87 | 267 | 0.5% |
| 76 | 263 | 0.5% |
| 77 | 245 | 0.5% |
| 89 | 244 | 0.5% |
| 80 | 244 | 0.5% |
| Other values (201) | 4505 | 8.5% |
Length
| Value | Count | Frequency (%) |
| none | 45711 | |
| 82 | 343 | 0.7% |
| 79 | 307 | 0.6% |
| 75 | 285 | 0.5% |
| 86 | 280 | 0.5% |
| 87 | 267 | 0.5% |
| 76 | 263 | 0.5% |
| 77 | 245 | 0.5% |
| 89 | 244 | 0.5% |
| 80 | 244 | 0.5% |
| Other values (201) | 4505 | 8.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 45711 | |
| o | 45711 | |
| n | 45711 | |
| e | 45711 | |
| 7 | 3477 | 1.7% |
| 8 | 3403 | 1.7% |
| 6 | 2569 | 1.3% |
| 9 | 1547 | 0.8% |
| 3 | 1485 | 0.7% |
| 5 | 996 | 0.5% |
| Other values (5) | 3829 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 137133 | |
| Uppercase Letter | 45711 | 22.8% |
| Decimal Number | 16695 | 8.3% |
| Other Punctuation | 611 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3477 | |
| 8 | 3403 | |
| 6 | 2569 | |
| 9 | 1547 | |
| 3 | 1485 | |
| 5 | 996 | 6.0% |
| 2 | 952 | 5.7% |
| 0 | 817 | 4.9% |
| 4 | 730 | 4.4% |
| 1 | 719 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 45711 | |
| n | 45711 | |
| e | 45711 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 45711 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 611 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 182844 | |
| Common | 17306 | 8.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3477 | |
| 8 | 3403 | |
| 6 | 2569 | |
| 9 | 1547 | |
| 3 | 1485 | |
| 5 | 996 | 5.8% |
| 2 | 952 | 5.5% |
| 0 | 817 | 4.7% |
| 4 | 730 | 4.2% |
| 1 | 719 | 4.2% |
Latin
| Value | Count | Frequency (%) |
| N | 45711 | |
| o | 45711 | |
| n | 45711 | |
| e | 45711 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 45711 | |
| o | 45711 | |
| n | 45711 | |
| e | 45711 | |
| 7 | 3477 | 1.7% |
| 8 | 3403 | 1.7% |
| 6 | 2569 | 1.3% |
| 9 | 1547 | 0.8% |
| 3 | 1485 | 0.7% |
| 5 | 996 | 0.5% |
| Other values (5) | 3829 | 1.9% |
Age
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| None | |
|---|---|
| 41 | 1273 |
| 40 | 1210 |
| 42 | 1186 |
| 43 | 1151 |
| Other values (45) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.3704786 |
| Min length | 2 |
Characters and Unicode
| Total characters | 177604 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 36108 | |
| 41 | 1273 | 2.4% |
| 40 | 1210 | 2.3% |
| 42 | 1186 | 2.3% |
| 43 | 1151 | 2.2% |
| 39 | 1036 | 2.0% |
| 44 | 1021 | 1.9% |
| 45 | 778 | 1.5% |
| 46 | 734 | 1.4% |
| 37 | 732 | 1.4% |
| Other values (40) | 7465 | 14.2% |
Length
| Value | Count | Frequency (%) |
| none | 36108 | |
| 41 | 1273 | 2.4% |
| 40 | 1210 | 2.3% |
| 42 | 1186 | 2.3% |
| 43 | 1151 | 2.2% |
| 39 | 1036 | 2.0% |
| 44 | 1021 | 1.9% |
| 45 | 778 | 1.5% |
| 46 | 734 | 1.4% |
| 37 | 732 | 1.4% |
| Other values (40) | 7465 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 36108 | |
| o | 36108 | |
| n | 36108 | |
| e | 36108 | |
| 4 | 10673 | 6.0% |
| 3 | 5183 | 2.9% |
| 7 | 3888 | 2.2% |
| 5 | 2959 | 1.7% |
| 2 | 2060 | 1.2% |
| 1 | 2005 | 1.1% |
| Other values (4) | 6404 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 108324 | |
| Uppercase Letter | 36108 | 20.3% |
| Decimal Number | 33172 | 18.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 10673 | |
| 3 | 5183 | |
| 7 | 3888 | 11.7% |
| 5 | 2959 | 8.9% |
| 2 | 2060 | 6.2% |
| 1 | 2005 | 6.0% |
| 0 | 1814 | 5.5% |
| 9 | 1643 | 5.0% |
| 6 | 1631 | 4.9% |
| 8 | 1316 | 4.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 36108 | |
| n | 36108 | |
| e | 36108 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 36108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 144432 | |
| Common | 33172 | 18.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 10673 | |
| 3 | 5183 | |
| 7 | 3888 | 11.7% |
| 5 | 2959 | 8.9% |
| 2 | 2060 | 6.2% |
| 1 | 2005 | 6.0% |
| 0 | 1814 | 5.5% |
| 9 | 1643 | 5.0% |
| 6 | 1631 | 4.9% |
| 8 | 1316 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| N | 36108 | |
| o | 36108 | |
| n | 36108 | |
| e | 36108 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 177604 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 36108 | |
| o | 36108 | |
| n | 36108 | |
| e | 36108 | |
| 4 | 10673 | 6.0% |
| 3 | 5183 | 2.9% |
| 7 | 3888 | 2.2% |
| 5 | 2959 | 1.7% |
| 2 | 2060 | 1.2% |
| 1 | 2005 | 1.1% |
| Other values (4) | 6404 | 3.6% |
First_Interaction
Categorical
| Distinct | 1468 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 08-Sep-04 | 1064 |
|---|---|
| 01-May-05 | 870 |
| 08-Feb-03 | 687 |
| 21-May-05 | 635 |
| 25-Oct-04 | 610 |
| Other values (1463) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 474246 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 14-Nov-04 |
|---|---|
| 2nd row | 26-May-06 |
| 3rd row | 07-Jan-04 |
| 4th row | 12-Feb-04 |
| 5th row | 14-Mar-04 |
Common Values
| Value | Count | Frequency (%) |
| 08-Sep-04 | 1064 | 2.0% |
| 01-May-05 | 870 | 1.7% |
| 08-Feb-03 | 687 | 1.3% |
| 21-May-05 | 635 | 1.2% |
| 25-Oct-04 | 610 | 1.2% |
| 09-Feb-05 | 567 | 1.1% |
| 03-Oct-04 | 528 | 1.0% |
| 11-Dec-04 | 507 | 1.0% |
| 21-Sep-04 | 485 | 0.9% |
| 10-May-05 | 437 | 0.8% |
| Other values (1458) | 46304 |
Length
| Value | Count | Frequency (%) |
| 08-sep-04 | 1064 | 2.0% |
| 01-may-05 | 870 | 1.7% |
| 08-feb-03 | 687 | 1.3% |
| 21-may-05 | 635 | 1.2% |
| 25-oct-04 | 610 | 1.2% |
| 09-feb-05 | 567 | 1.1% |
| 03-oct-04 | 528 | 1.0% |
| 11-dec-04 | 507 | 1.0% |
| 21-sep-04 | 485 | 0.9% |
| 10-may-05 | 437 | 0.8% |
| Other values (1458) | 46304 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74974 | |
| 1 | 23358 | 4.9% |
| 4 | 22788 | 4.8% |
| 2 | 21914 | 4.6% |
| 5 | 21741 | 4.6% |
| 3 | 18237 | 3.8% |
| e | 14010 | 3.0% |
| a | 13657 | 2.9% |
| J | 13151 | 2.8% |
| Other values (23) | 145028 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 210776 | |
| Dash Punctuation | 105388 | |
| Lowercase Letter | 105388 | |
| Uppercase Letter | 52694 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14010 | |
| a | 13657 | |
| u | 11934 | |
| n | 9917 | |
| c | 9794 | |
| p | 7848 | |
| r | 6963 | |
| t | 4915 | 4.7% |
| y | 4866 | 4.6% |
| b | 4784 | 4.5% |
| Other values (4) | 16700 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 74974 | |
| 1 | 23358 | 11.1% |
| 4 | 22788 | 10.8% |
| 2 | 21914 | 10.4% |
| 5 | 21741 | 10.3% |
| 3 | 18237 | 8.7% |
| 6 | 11075 | 5.3% |
| 8 | 6860 | 3.3% |
| 7 | 5048 | 2.4% |
| 9 | 4781 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 13151 | |
| M | 8328 | |
| A | 7613 | |
| O | 4915 | 9.3% |
| D | 4879 | 9.3% |
| F | 4784 | 9.1% |
| N | 4677 | 8.9% |
| S | 4347 | 8.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105388 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 316164 | |
| Latin | 158082 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14010 | 8.9% |
| a | 13657 | 8.6% |
| J | 13151 | 8.3% |
| u | 11934 | 7.5% |
| n | 9917 | 6.3% |
| c | 9794 | 6.2% |
| M | 8328 | 5.3% |
| p | 7848 | 5.0% |
| A | 7613 | 4.8% |
| r | 6963 | 4.4% |
| Other values (12) | 54867 |
Common
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74974 | |
| 1 | 23358 | 7.4% |
| 4 | 22788 | 7.2% |
| 2 | 21914 | 6.9% |
| 5 | 21741 | 6.9% |
| 3 | 18237 | 5.8% |
| 6 | 11075 | 3.5% |
| 8 | 6860 | 2.2% |
| 7 | 5048 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 474246 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74974 | |
| 1 | 23358 | 4.9% |
| 4 | 22788 | 4.8% |
| 2 | 21914 | 4.6% |
| 5 | 21741 | 4.6% |
| 3 | 18237 | 3.8% |
| e | 14010 | 3.0% |
| a | 13657 | 2.9% |
| J | 13151 | 2.8% |
| Other values (23) | 145028 |
City_Type
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23236 |
| Missing (%) | 44.1% |
| Memory size | 411.8 KiB |
| B | |
|---|---|
| H | |
| D | |
| G | |
| C | |
| Other values (4) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29458 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | H |
|---|---|
| 2nd row | H |
| 3rd row | B |
| 4th row | B |
| 5th row | G |
Common Values
| Value | Count | Frequency (%) |
| B | 5814 | 11.0% |
| H | 4267 | 8.1% |
| D | 3811 | 7.2% |
| G | 2990 | 5.7% |
| C | 2962 | 5.6% |
| E | 2849 | 5.4% |
| A | 2413 | 4.6% |
| I | 2370 | 4.5% |
| F | 1982 | 3.8% |
| (Missing) | 23236 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 5814 | |
| h | 4267 | |
| d | 3811 | |
| g | 2990 | |
| c | 2962 | |
| e | 2849 | |
| a | 2413 | |
| i | 2370 | |
| f | 1982 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 5814 | |
| H | 4267 | |
| D | 3811 | |
| G | 2990 | |
| C | 2962 | |
| E | 2849 | |
| A | 2413 | |
| I | 2370 | |
| F | 1982 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 29458 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 5814 | |
| H | 4267 | |
| D | 3811 | |
| G | 2990 | |
| C | 2962 | |
| E | 2849 | |
| A | 2413 | |
| I | 2370 | |
| F | 1982 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29458 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 5814 | |
| H | 4267 | |
| D | 3811 | |
| G | 2990 | |
| C | 2962 | |
| E | 2849 | |
| A | 2413 | |
| I | 2370 | |
| F | 1982 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 5814 | |
| H | 4267 | |
| D | 3811 | |
| G | 2990 | |
| C | 2962 | |
| E | 2849 | |
| A | 2413 | |
| I | 2370 | |
| F | 1982 | 6.7% |
Employer_Category
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 42095 |
| Missing (%) | 79.9% |
| Memory size | 411.8 KiB |
| Technology | |
|---|---|
| Software Industry | |
| Others | |
| Consulting | |
| Education | |
| Other values (9) |
Length
| Max length | 17 |
|---|---|
| Median length | 13 |
| Mean length | 9.8138504 |
| Min length | 4 |
Characters and Unicode
| Total characters | 104017 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Broadcasting |
|---|---|
| 2nd row | Broadcasting |
| 3rd row | Education |
| 4th row | Retail |
| 5th row | BFSI |
Common Values
| Value | Count | Frequency (%) |
| Technology | 2505 | 4.8% |
| Software Industry | 1645 | 3.1% |
| Others | 1537 | 2.9% |
| Consulting | 1508 | 2.9% |
| Education | 774 | 1.5% |
| BFSI | 569 | 1.1% |
| Retail | 360 | 0.7% |
| Manufacturing | 360 | 0.7% |
| Health | 309 | 0.6% |
| Transport | 256 | 0.5% |
| Other values (4) | 776 | 1.5% |
| (Missing) | 42095 |
Length
| Value | Count | Frequency (%) |
| technology | 2505 | |
| software | 1645 | |
| industry | 1645 | |
| others | 1537 | |
| consulting | 1508 | |
| education | 774 | 6.2% |
| bfsi | 569 | 4.6% |
| retail | 360 | 2.9% |
| manufacturing | 360 | 2.9% |
| health | 309 | 2.5% |
| Other values (6) | 1286 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 9931 | 9.5% |
| n | 9048 | 8.7% |
| t | 9034 | 8.7% |
| e | 7212 | 6.9% |
| r | 5831 | 5.6% |
| s | 5332 | 5.1% |
| l | 5110 | 4.9% |
| a | 4836 | 4.6% |
| u | 4647 | 4.5% |
| g | 4505 | 4.3% |
| Other values (21) | 38531 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 87913 | |
| Uppercase Letter | 14205 | 13.7% |
| Space Separator | 1899 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9931 | |
| n | 9048 | |
| t | 9034 | |
| e | 7212 | 8.2% |
| r | 5831 | 6.6% |
| s | 5332 | 6.1% |
| l | 5110 | 5.8% |
| a | 4836 | 5.5% |
| u | 4647 | 5.3% |
| g | 4505 | 5.1% |
| Other values (9) | 22427 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2935 | |
| I | 2214 | |
| S | 2214 | |
| O | 1537 | |
| C | 1508 | |
| E | 1028 | 7.2% |
| F | 785 | 5.5% |
| B | 701 | 4.9% |
| R | 614 | 4.3% |
| M | 360 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1899 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 102118 | |
| Common | 1899 | 1.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 9931 | 9.7% |
| n | 9048 | 8.9% |
| t | 9034 | 8.8% |
| e | 7212 | 7.1% |
| r | 5831 | 5.7% |
| s | 5332 | 5.2% |
| l | 5110 | 5.0% |
| a | 4836 | 4.7% |
| u | 4647 | 4.6% |
| g | 4505 | 4.4% |
| Other values (20) | 36632 |
Common
| Value | Count | Frequency (%) |
| 1899 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104017 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 9931 | 9.5% |
| n | 9048 | 8.7% |
| t | 9034 | 8.7% |
| e | 7212 | 6.9% |
| r | 5831 | 5.6% |
| s | 5332 | 5.1% |
| l | 5110 | 4.9% |
| a | 4836 | 4.6% |
| u | 4647 | 4.5% |
| g | 4505 | 4.3% |
| Other values (21) | 38531 |
Camp_Start_Date
Categorical
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 27-Sep-05 | |
|---|---|
| 19-Feb-05 | 3074 |
| 09-Jan-04 | 3025 |
| 13-Jun-05 | 2865 |
| 30-Mar-06 | 2662 |
| Other values (35) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 474246 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 09-Jul-05 |
|---|---|
| 2nd row | 17-Oct-05 |
| 3rd row | 04-Jan-04 |
| 4th row | 01-Feb-04 |
| 5th row | 07-Dec-03 |
Common Values
| Value | Count | Frequency (%) |
| 27-Sep-05 | 7333 | 13.9% |
| 19-Feb-05 | 3074 | 5.8% |
| 09-Jan-04 | 3025 | 5.7% |
| 13-Jun-05 | 2865 | 5.4% |
| 30-Mar-06 | 2662 | 5.1% |
| 03-Jan-05 | 2646 | 5.0% |
| 17-Oct-05 | 2529 | 4.8% |
| 09-Jul-05 | 2520 | 4.8% |
| 22-Dec-04 | 2450 | 4.6% |
| 16-Aug-05 | 1966 | 3.7% |
| Other values (30) | 21624 |
Length
| Value | Count | Frequency (%) |
| 27-sep-05 | 7333 | 13.9% |
| 19-feb-05 | 3074 | 5.8% |
| 09-jan-04 | 3025 | 5.7% |
| 13-jun-05 | 2865 | 5.4% |
| 30-mar-06 | 2662 | 5.1% |
| 03-jan-05 | 2646 | 5.0% |
| 17-oct-05 | 2529 | 4.8% |
| 09-jul-05 | 2520 | 4.8% |
| 22-dec-04 | 2450 | 4.6% |
| 16-aug-05 | 1966 | 3.7% |
| Other values (30) | 21624 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74390 | |
| 5 | 35286 | 7.4% |
| e | 22308 | 4.7% |
| 1 | 21042 | 4.4% |
| 2 | 18984 | 4.0% |
| 3 | 15240 | 3.2% |
| 4 | 14893 | 3.1% |
| J | 14016 | 3.0% |
| 7 | 12752 | 2.7% |
| Other values (22) | 139947 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 210776 | |
| Dash Punctuation | 105388 | |
| Lowercase Letter | 105388 | |
| Uppercase Letter | 52694 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 22308 | |
| u | 12002 | |
| n | 11496 | |
| c | 10874 | |
| p | 8899 | 8.4% |
| b | 8521 | 8.1% |
| a | 8517 | 8.1% |
| t | 5872 | 5.6% |
| v | 3881 | 3.7% |
| o | 3881 | 3.7% |
| Other values (4) | 9137 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 74390 | |
| 5 | 35286 | |
| 1 | 21042 | 10.0% |
| 2 | 18984 | 9.0% |
| 3 | 15240 | 7.2% |
| 4 | 14893 | 7.1% |
| 7 | 12752 | 6.1% |
| 9 | 11958 | 5.7% |
| 6 | 6231 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 14016 | |
| S | 8785 | |
| F | 8521 | |
| O | 5872 | |
| D | 5002 | 9.5% |
| N | 3881 | 7.4% |
| A | 3805 | 7.2% |
| M | 2812 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105388 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 316164 | |
| Latin | 158082 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 22308 | |
| J | 14016 | 8.9% |
| u | 12002 | 7.6% |
| n | 11496 | 7.3% |
| c | 10874 | 6.9% |
| p | 8899 | 5.6% |
| S | 8785 | 5.6% |
| b | 8521 | 5.4% |
| F | 8521 | 5.4% |
| a | 8517 | 5.4% |
| Other values (12) | 44143 |
Common
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74390 | |
| 5 | 35286 | 11.2% |
| 1 | 21042 | 6.7% |
| 2 | 18984 | 6.0% |
| 3 | 15240 | 4.8% |
| 4 | 14893 | 4.7% |
| 7 | 12752 | 4.0% |
| 9 | 11958 | 3.8% |
| 6 | 6231 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 474246 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 74390 | |
| 5 | 35286 | 7.4% |
| e | 22308 | 4.7% |
| 1 | 21042 | 4.4% |
| 2 | 18984 | 4.0% |
| 3 | 15240 | 3.2% |
| 4 | 14893 | 3.1% |
| J | 14016 | 3.0% |
| 7 | 12752 | 2.7% |
| Other values (22) | 139947 |
Camp_End_Date
Categorical
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 07-Nov-07 | |
|---|---|
| 22-Jul-05 | |
| 23-Aug-05 | |
| 04-Feb-05 | 2742 |
| 03-Apr-06 | 2662 |
| Other values (33) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 474246 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22-Jul-05 |
|---|---|
| 2nd row | 07-Nov-07 |
| 3rd row | 09-Jan-04 |
| 4th row | 18-Feb-04 |
| 5th row | 13-Jun-04 |
Common Values
| Value | Count | Frequency (%) |
| 07-Nov-07 | 9862 | |
| 22-Jul-05 | 5385 | 10.2% |
| 23-Aug-05 | 3074 | 5.8% |
| 04-Feb-05 | 2742 | 5.2% |
| 03-Apr-06 | 2662 | 5.1% |
| 20-Feb-05 | 2646 | 5.0% |
| 06-Jan-05 | 2450 | 4.6% |
| 14-Oct-05 | 2029 | 3.9% |
| 18-Oct-04 | 1818 | 3.5% |
| 02-Jun-05 | 1659 | 3.1% |
| Other values (28) | 18367 |
Length
| Value | Count | Frequency (%) |
| 07-nov-07 | 9862 | |
| 22-jul-05 | 5385 | 10.2% |
| 23-aug-05 | 3074 | 5.8% |
| 04-feb-05 | 2742 | 5.2% |
| 03-apr-06 | 2662 | 5.1% |
| 20-feb-05 | 2646 | 5.0% |
| 06-jan-05 | 2450 | 4.6% |
| 14-oct-05 | 2029 | 3.9% |
| 18-oct-04 | 1818 | 3.5% |
| 02-jun-05 | 1659 | 3.1% |
| Other values (28) | 18367 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 79822 | |
| 5 | 32290 | 6.8% |
| 2 | 24110 | 5.1% |
| 7 | 21331 | 4.5% |
| 1 | 16259 | 3.4% |
| J | 15203 | 3.2% |
| u | 14604 | 3.1% |
| e | 13622 | 2.9% |
| 4 | 12517 | 2.6% |
| Other values (23) | 139100 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 210776 | |
| Dash Punctuation | 105388 | |
| Lowercase Letter | 105388 | |
| Uppercase Letter | 52694 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 14604 | |
| e | 13622 | |
| o | 10949 | |
| v | 10949 | |
| b | 9485 | |
| n | 8184 | |
| l | 7019 | |
| p | 6823 | |
| c | 6733 | |
| t | 5512 | 5.2% |
| Other values (4) | 11508 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 79822 | |
| 5 | 32290 | |
| 2 | 24110 | 11.4% |
| 7 | 21331 | 10.1% |
| 1 | 16259 | 7.7% |
| 4 | 12517 | 5.9% |
| 3 | 9376 | 4.4% |
| 6 | 8890 | 4.2% |
| 8 | 4896 | 2.3% |
| 9 | 1285 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 15203 | |
| N | 10949 | |
| F | 9485 | |
| A | 7144 | |
| O | 5512 | 10.5% |
| S | 2916 | 5.5% |
| D | 1221 | 2.3% |
| M | 264 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105388 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 316164 | |
| Latin | 158082 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| J | 15203 | 9.6% |
| u | 14604 | 9.2% |
| e | 13622 | 8.6% |
| o | 10949 | 6.9% |
| v | 10949 | 6.9% |
| N | 10949 | 6.9% |
| F | 9485 | 6.0% |
| b | 9485 | 6.0% |
| n | 8184 | 5.2% |
| A | 7144 | 4.5% |
| Other values (12) | 47508 |
Common
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 79822 | |
| 5 | 32290 | 10.2% |
| 2 | 24110 | 7.6% |
| 7 | 21331 | 6.7% |
| 1 | 16259 | 5.1% |
| 4 | 12517 | 4.0% |
| 3 | 9376 | 3.0% |
| 6 | 8890 | 2.8% |
| 8 | 4896 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 474246 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 105388 | |
| 0 | 79822 | |
| 5 | 32290 | 6.8% |
| 2 | 24110 | 5.1% |
| 7 | 21331 | 4.5% |
| 1 | 16259 | 3.4% |
| J | 15203 | 3.2% |
| u | 14604 | 3.1% |
| e | 13622 | 2.9% |
| 4 | 12517 | 2.6% |
| Other values (23) | 139100 |
Category1
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| First | |
|---|---|
| Second | |
| Third |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.2003833 |
| Min length | 5 |
Characters and Unicode
| Total characters | 274029 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | First |
|---|---|
| 2nd row | Second |
| 3rd row | First |
| 4th row | First |
| 5th row | First |
Common Values
| Value | Count | Frequency (%) |
| First | 34990 | |
| Second | 10559 | 20.0% |
| Third | 7145 | 13.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| first | 34990 | |
| second | 10559 | 20.0% |
| third | 7145 | 13.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 42135 | |
| r | 42135 | |
| F | 34990 | |
| s | 34990 | |
| t | 34990 | |
| d | 17704 | |
| S | 10559 | 3.9% |
| e | 10559 | 3.9% |
| c | 10559 | 3.9% |
| o | 10559 | 3.9% |
| Other values (3) | 24849 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 221335 | |
| Uppercase Letter | 52694 | 19.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 42135 | |
| r | 42135 | |
| s | 34990 | |
| t | 34990 | |
| d | 17704 | |
| e | 10559 | 4.8% |
| c | 10559 | 4.8% |
| o | 10559 | 4.8% |
| n | 10559 | 4.8% |
| h | 7145 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 34990 | |
| S | 10559 | 20.0% |
| T | 7145 | 13.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 274029 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 42135 | |
| r | 42135 | |
| F | 34990 | |
| s | 34990 | |
| t | 34990 | |
| d | 17704 | |
| S | 10559 | 3.9% |
| e | 10559 | 3.9% |
| c | 10559 | 3.9% |
| o | 10559 | 3.9% |
| Other values (3) | 24849 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 274029 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 42135 | |
| r | 42135 | |
| F | 34990 | |
| s | 34990 | |
| t | 34990 | |
| d | 17704 | |
| S | 10559 | 3.9% |
| e | 10559 | 3.9% |
| c | 10559 | 3.9% |
| o | 10559 | 3.9% |
| Other values (3) | 24849 |
Category2
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| F | |
|---|---|
| E | |
| A | |
| G | |
| D | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | E |
|---|---|
| 2nd row | A |
| 3rd row | C |
| 4th row | E |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 17316 | |
| E | 14684 | |
| A | 7687 | |
| G | 7145 | |
| D | 2872 | 5.5% |
| B | 1718 | 3.3% |
| C | 1272 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 17316 | |
| e | 14684 | |
| a | 7687 | |
| g | 7145 | |
| d | 2872 | 5.5% |
| b | 1718 | 3.3% |
| c | 1272 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 17316 | |
| E | 14684 | |
| A | 7687 | |
| G | 7145 | |
| D | 2872 | 5.5% |
| B | 1718 | 3.3% |
| C | 1272 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 52694 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 17316 | |
| E | 14684 | |
| A | 7687 | |
| G | 7145 | |
| D | 2872 | 5.5% |
| B | 1718 | 3.3% |
| C | 1272 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 52694 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 17316 | |
| E | 14684 | |
| A | 7687 | |
| G | 7145 | |
| D | 2872 | 5.5% |
| B | 1718 | 3.3% |
| C | 1272 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 17316 | |
| E | 14684 | |
| A | 7687 | |
| G | 7145 | |
| D | 2872 | 5.5% |
| B | 1718 | 3.3% |
| C | 1272 | 2.4% |
Category3
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| 2 | |
|---|---|
| 1 | 278 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 52694 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 52694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 52416 | |
| 1 | 278 | 0.5% |
Donation
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 48337 |
| Missing (%) | 91.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.492541 |
| Minimum | 10 |
|---|---|
| Maximum | 280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 20 |
| median | 30 |
| Q3 | 40 |
| 95-th percentile | 80 |
| Maximum | 280 |
| Range | 270 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 24.132949 |
|---|---|
| Coefficient of variation (CV) | 0.74272274 |
| Kurtosis | 10.114895 |
| Mean | 32.492541 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.3891015 |
| Sum | 141570 |
| Variance | 582.39922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 1110 | 2.1% |
| 10 | 934 | 1.8% |
| 30 | 885 | 1.7% |
| 40 | 526 | 1.0% |
| 50 | 342 | 0.6% |
| 60 | 184 | 0.3% |
| 70 | 146 | 0.3% |
| 80 | 65 | 0.1% |
| 90 | 49 | 0.1% |
| 100 | 38 | 0.1% |
| Other values (11) | 78 | 0.1% |
| (Missing) | 48337 |
| Value | Count | Frequency (%) |
| 10 | 934 | |
| 20 | 1110 | |
| 30 | 885 | |
| 40 | 526 | |
| 50 | 342 | 0.6% |
| 60 | 184 | 0.3% |
| 70 | 146 | 0.3% |
| 80 | 65 | 0.1% |
| 90 | 49 | 0.1% |
| 100 | 38 | 0.1% |
| Value | Count | Frequency (%) |
| 280 | 1 | < 0.1% |
| 250 | 1 | < 0.1% |
| 210 | 2 | < 0.1% |
| 180 | 2 | < 0.1% |
| 170 | 5 | < 0.1% |
| 160 | 5 | < 0.1% |
| 150 | 4 | < 0.1% |
| 140 | 12 | |
| 130 | 12 | |
| 120 | 13 |
Health_Score
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 2841 |
|---|---|
| Distinct (%) | 65.2% |
| Missing | 48337 |
| Missing (%) | 91.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.51643519 |
| Minimum | 0.001666667 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0.001666667 |
|---|---|
| 5-th percentile | 0.057507987 |
| Q1 | 0.25966851 |
| median | 0.52697095 |
| Q3 | 0.77166667 |
| 95-th percentile | 0.95636567 |
| Maximum | 1 |
| Range | 0.99833333 |
| Interquartile range (IQR) | 0.51199816 |
Descriptive statistics
| Standard deviation | 0.28926877 |
|---|---|
| Coefficient of variation (CV) | 0.56012599 |
| Kurtosis | -1.2117525 |
| Mean | 0.51643519 |
| Median Absolute Deviation (MAD) | 0.25048569 |
| Skewness | -0.066340053 |
| Sum | 2250.1081 |
| Variance | 0.083676422 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.360338573 | 76 | 0.1% |
| 0.46485623 | 71 | 0.1% |
| 0.559854897 | 71 | 0.1% |
| 0.753325272 | 51 | 0.1% |
| 0.752136752 | 43 | 0.1% |
| 0.887545345 | 42 | 0.1% |
| 0.696428571 | 39 | 0.1% |
| 0.436464088 | 39 | 0.1% |
| 0.642172524 | 38 | 0.1% |
| 0.779552716 | 34 | 0.1% |
| Other values (2831) | 3853 | 7.3% |
| (Missing) | 48337 |
| Value | Count | Frequency (%) |
| 0.001666667 | 1 | |
| 0.003333333 | 1 | |
| 0.003846154 | 1 | |
| 0.003937008 | 1 | |
| 0.003968254 | 1 | |
| 0.004149378 | 1 | |
| 0.004926108 | 1 | |
| 0.005 | 1 | |
| 0.007142857 | 2 | |
| 0.007220217 | 1 |
| Value | Count | Frequency (%) |
| 1 | 26 | |
| 0.99879081 | 1 | < 0.1% |
| 0.998402556 | 1 | < 0.1% |
| 0.998333333 | 1 | < 0.1% |
| 0.997925311 | 1 | < 0.1% |
| 0.99758162 | 2 | < 0.1% |
| 0.997237569 | 1 | < 0.1% |
| 0.996666667 | 1 | < 0.1% |
| 0.996183206 | 1 | < 0.1% |
| 0.996153846 | 1 | < 0.1% |
Health Score
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 205 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 47214 |
| Missing (%) | 89.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55453795 |
| Minimum | 0.058992806 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0.058992806 |
|---|---|
| 5-th percentile | 0.11310592 |
| Q1 | 0.38962606 |
| median | 0.51879699 |
| Q3 | 0.76974723 |
| 95-th percentile | 0.9644566 |
| Maximum | 1 |
| Range | 0.94100719 |
| Interquartile range (IQR) | 0.38012118 |
Descriptive statistics
| Standard deviation | 0.25110722 |
|---|---|
| Coefficient of variation (CV) | 0.45282242 |
| Kurtosis | -0.93694133 |
| Mean | 0.55453795 |
| Median Absolute Deviation (MAD) | 0.17239722 |
| Skewness | 0.031960567 |
| Sum | 3038.868 |
| Variance | 0.063054835 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.402053712 | 724 | 1.4% |
| 0.373205742 | 377 | 0.7% |
| 0.455371248 | 86 | 0.2% |
| 0.505134281 | 83 | 0.2% |
| 0.065741858 | 79 | 0.1% |
| 0.572376357 | 76 | 0.1% |
| 0.69119421 | 70 | 0.1% |
| 0.747285887 | 69 | 0.1% |
| 0.803377563 | 69 | 0.1% |
| 0.507840772 | 69 | 0.1% |
| Other values (195) | 3778 | 7.2% |
| (Missing) | 47214 |
| Value | Count | Frequency (%) |
| 0.058992806 | 27 | 0.1% |
| 0.065741858 | 79 | |
| 0.084892086 | 15 | < 0.1% |
| 0.099280576 | 3 | < 0.1% |
| 0.102062975 | 67 | |
| 0.103136309 | 41 | |
| 0.113105925 | 50 | |
| 0.11942446 | 11 | < 0.1% |
| 0.125452352 | 28 | 0.1% |
| 0.127035831 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 0.999605055 | 1 | < 0.1% |
| 0.999396864 | 2 | < 0.1% |
| 0.999316473 | 1 | < 0.1% |
| 0.998815166 | 1 | < 0.1% |
| 0.998632946 | 1 | < 0.1% |
| 0.998190591 | 3 | |
| 0.998025276 | 2 | < 0.1% |
| 0.997949419 | 1 | < 0.1% |
| 0.997828447 | 2 | < 0.1% |
Number_of_stall_visited
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 48179 |
| Missing (%) | 91.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9158361 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 12 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.6791891 |
|---|---|
| Coefficient of variation (CV) | 0.57588597 |
| Kurtosis | -1.1108093 |
| Mean | 2.9158361 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.39593218 |
| Sum | 13165 |
| Variance | 2.8196761 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1257 | 2.4% |
| 2 | 884 | 1.7% |
| 3 | 764 | 1.4% |
| 5 | 718 | 1.4% |
| 4 | 517 | 1.0% |
| 6 | 351 | 0.7% |
| 7 | 12 | < 0.1% |
| 0 | 12 | < 0.1% |
| (Missing) | 48179 |
| Value | Count | Frequency (%) |
| 0 | 12 | < 0.1% |
| 1 | 1257 | |
| 2 | 884 | |
| 3 | 764 | |
| 4 | 517 | |
| 5 | 718 | |
| 6 | 351 | 0.7% |
| 7 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 12 | < 0.1% |
| 6 | 351 | 0.7% |
| 5 | 718 | |
| 4 | 517 | |
| 3 | 764 | |
| 2 | 884 | |
| 1 | 1257 | |
| 0 | 12 | < 0.1% |
Last_Stall_Visited_Number
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 48179 |
| Missing (%) | 91.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4013289 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 12 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 411.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4635425 |
|---|---|
| Coefficient of variation (CV) | 0.6094719 |
| Kurtosis | -0.34187765 |
| Mean | 2.4013289 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.78695724 |
| Sum | 10842 |
| Variance | 2.1419566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1728 | 3.3% |
| 2 | 897 | 1.7% |
| 3 | 855 | 1.6% |
| 4 | 534 | 1.0% |
| 5 | 320 | 0.6% |
| 6 | 164 | 0.3% |
| 0 | 12 | < 0.1% |
| 7 | 5 | < 0.1% |
| (Missing) | 48179 |
| Value | Count | Frequency (%) |
| 0 | 12 | < 0.1% |
| 1 | 1728 | |
| 2 | 897 | |
| 3 | 855 | |
| 4 | 534 | 1.0% |
| 5 | 320 | 0.6% |
| 6 | 164 | 0.3% |
| 7 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 5 | < 0.1% |
| 6 | 164 | 0.3% |
| 5 | 320 | 0.6% |
| 4 | 534 | 1.0% |
| 3 | 855 | |
| 2 | 897 | |
| 1 | 1728 | |
| 0 | 12 | < 0.1% |
| Patient_ID | Health_Camp_ID | Var1 | Var2 | Var5 | Donation | Health_Score | Health Score | Number_of_stall_visited | Last_Stall_Visited_Number | Var3 | Var4 | outcome | Online_Follower | LinkedIn_Shared | Twitter_Shared | Facebook_Shared | Income | Age | City_Type | Employer_Category | Camp_Start_Date | Camp_End_Date | Category1 | Category2 | Category3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Patient_ID | 1.000 | 0.002 | -0.004 | -0.005 | -0.002 | -0.000 | -0.015 | -0.014 | 0.011 | 0.015 | 0.057 | 0.020 | 0.000 | 0.027 | 0.045 | 0.027 | 0.031 | 0.032 | 0.073 | 0.042 | 0.089 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Health_Camp_ID | 0.002 | 1.000 | 0.049 | 0.030 | 0.046 | 0.025 | -0.003 | -0.022 | -0.058 | -0.023 | 0.000 | 0.020 | 0.283 | 0.059 | 0.067 | 0.054 | 0.059 | 0.077 | 0.065 | 0.041 | 0.002 | 0.944 | 0.903 | 0.521 | 0.530 | 0.489 |
| Var1 | -0.004 | 0.049 | 1.000 | 0.565 | 0.874 | 0.040 | 0.065 | 0.003 | 0.025 | 0.013 | 0.612 | 0.284 | 0.043 | 0.097 | 0.094 | 0.036 | 0.042 | 0.056 | 0.197 | 0.042 | 0.068 | 0.018 | 0.019 | 0.008 | 0.010 | 0.004 |
| Var2 | -0.005 | 0.030 | 0.565 | 1.000 | 0.604 | 0.056 | 0.064 | -0.004 | 0.015 | 0.022 | 0.823 | 0.279 | 0.032 | 0.106 | 0.120 | 0.039 | 0.049 | 0.052 | 0.256 | 0.039 | 0.067 | 0.015 | 0.015 | 0.000 | 0.009 | 0.000 |
| Var5 | -0.002 | 0.046 | 0.874 | 0.604 | 1.000 | 0.039 | 0.056 | 0.003 | 0.018 | 0.008 | 0.782 | 0.298 | 0.055 | 0.118 | 0.140 | 0.105 | 0.098 | 0.071 | 0.226 | 0.037 | 0.058 | 0.030 | 0.032 | 0.006 | 0.024 | 0.027 |
| Donation | -0.000 | 0.025 | 0.040 | 0.056 | 0.039 | 1.000 | 0.421 | NaN | NaN | NaN | 0.000 | 0.000 | 1.000 | 0.067 | 0.061 | 0.055 | 0.010 | 0.047 | 0.104 | 0.011 | 0.000 | 0.051 | 0.052 | 1.000 | 0.101 | 0.014 |
| Health_Score | -0.015 | -0.003 | 0.065 | 0.064 | 0.056 | 0.421 | 1.000 | NaN | NaN | NaN | 0.000 | 0.006 | 1.000 | 0.000 | 0.000 | 0.061 | 0.051 | 0.035 | 0.037 | 0.037 | 0.042 | 0.116 | 0.116 | 1.000 | 0.068 | 0.051 |
| Health Score | -0.014 | -0.022 | 0.003 | -0.004 | 0.003 | NaN | NaN | 1.000 | NaN | NaN | 0.031 | 0.029 | 1.000 | 0.051 | 0.076 | 0.068 | 0.052 | 0.031 | 0.053 | 0.002 | 0.000 | 0.252 | 0.252 | 1.000 | 0.093 | 1.000 |
| Number_of_stall_visited | 0.011 | -0.058 | 0.025 | 0.015 | 0.018 | NaN | NaN | NaN | 1.000 | 0.572 | 0.025 | 0.000 | 0.999 | 0.028 | 0.026 | 0.051 | 0.032 | 0.078 | 0.040 | 0.022 | 0.089 | 0.084 | 0.084 | 1.000 | 1.000 | 1.000 |
| Last_Stall_Visited_Number | 0.015 | -0.023 | 0.013 | 0.022 | 0.008 | NaN | NaN | NaN | 0.572 | 1.000 | 0.022 | 0.223 | 0.999 | 0.034 | 0.000 | 0.066 | 0.000 | 0.080 | 0.000 | 0.034 | 0.035 | 0.037 | 0.037 | 1.000 | 1.000 | 1.000 |
| Var3 | 0.057 | 0.000 | 0.612 | 0.823 | 0.782 | 0.000 | 0.000 | 0.031 | 0.025 | 0.022 | 1.000 | 0.207 | 0.011 | 0.003 | 0.006 | 0.003 | 0.003 | 0.054 | 0.767 | 0.055 | 0.079 | 0.000 | 0.000 | 0.000 | 0.005 | 0.000 |
| Var4 | 0.020 | 0.020 | 0.284 | 0.279 | 0.298 | 0.000 | 0.006 | 0.029 | 0.000 | 0.223 | 0.207 | 1.000 | 0.037 | 0.053 | 0.065 | 0.050 | 0.052 | 0.056 | 0.114 | 0.027 | 0.055 | 0.024 | 0.024 | 0.015 | 0.020 | 0.006 |
| outcome | 0.000 | 0.283 | 0.043 | 0.032 | 0.055 | 1.000 | 1.000 | 1.000 | 0.999 | 0.999 | 0.011 | 0.037 | 1.000 | 0.050 | 0.057 | 0.045 | 0.040 | 0.087 | 0.099 | 0.020 | 0.041 | 0.518 | 0.417 | 0.472 | 0.481 | 0.000 |
| Online_Follower | 0.027 | 0.059 | 0.097 | 0.106 | 0.118 | 0.067 | 0.000 | 0.051 | 0.028 | 0.034 | 0.003 | 0.053 | 0.050 | 1.000 | 0.476 | 0.606 | 0.458 | 0.348 | 0.390 | 0.032 | 0.123 | 0.078 | 0.079 | 0.022 | 0.049 | 0.011 |
| LinkedIn_Shared | 0.045 | 0.067 | 0.094 | 0.120 | 0.140 | 0.061 | 0.000 | 0.076 | 0.026 | 0.000 | 0.006 | 0.065 | 0.057 | 0.476 | 1.000 | 0.382 | 0.473 | 0.385 | 0.418 | 0.027 | 0.130 | 0.090 | 0.093 | 0.022 | 0.055 | 0.012 |
| Twitter_Shared | 0.027 | 0.054 | 0.036 | 0.039 | 0.105 | 0.055 | 0.061 | 0.068 | 0.051 | 0.066 | 0.003 | 0.050 | 0.045 | 0.606 | 0.382 | 1.000 | 0.516 | 0.339 | 0.383 | 0.047 | 0.117 | 0.069 | 0.072 | 0.019 | 0.042 | 0.004 |
| Facebook_Shared | 0.031 | 0.059 | 0.042 | 0.049 | 0.098 | 0.010 | 0.051 | 0.052 | 0.032 | 0.000 | 0.003 | 0.052 | 0.040 | 0.458 | 0.473 | 0.516 | 1.000 | 0.350 | 0.398 | 0.031 | 0.087 | 0.077 | 0.078 | 0.023 | 0.049 | 0.008 |
| Income | 0.032 | 0.077 | 0.056 | 0.052 | 0.071 | 0.047 | 0.035 | 0.031 | 0.078 | 0.080 | 0.054 | 0.056 | 0.087 | 0.348 | 0.385 | 0.339 | 0.350 | 1.000 | 0.409 | 0.074 | 0.103 | 0.099 | 0.101 | 0.037 | 0.084 | 0.023 |
| Age | 0.073 | 0.065 | 0.197 | 0.256 | 0.226 | 0.104 | 0.037 | 0.053 | 0.040 | 0.000 | 0.767 | 0.114 | 0.099 | 0.390 | 0.418 | 0.383 | 0.398 | 0.409 | 1.000 | 0.109 | 0.172 | 0.040 | 0.042 | 0.027 | 0.080 | 0.035 |
| City_Type | 0.042 | 0.041 | 0.042 | 0.039 | 0.037 | 0.011 | 0.037 | 0.002 | 0.022 | 0.034 | 0.055 | 0.027 | 0.020 | 0.032 | 0.027 | 0.047 | 0.031 | 0.074 | 0.109 | 1.000 | 0.102 | 0.076 | 0.074 | 0.017 | 0.043 | 0.000 |
| Employer_Category | 0.089 | 0.002 | 0.068 | 0.067 | 0.058 | 0.000 | 0.042 | 0.000 | 0.089 | 0.035 | 0.079 | 0.055 | 0.041 | 0.123 | 0.130 | 0.117 | 0.087 | 0.103 | 0.172 | 0.102 | 1.000 | 0.012 | 0.011 | 0.034 | 0.016 | 0.022 |
| Camp_Start_Date | 0.000 | 0.944 | 0.018 | 0.015 | 0.030 | 0.051 | 0.116 | 0.252 | 0.084 | 0.037 | 0.000 | 0.024 | 0.518 | 0.078 | 0.090 | 0.069 | 0.077 | 0.099 | 0.040 | 0.076 | 0.012 | 1.000 | 0.973 | 1.000 | 0.990 | 1.000 |
| Camp_End_Date | 0.000 | 0.903 | 0.019 | 0.015 | 0.032 | 0.052 | 0.116 | 0.252 | 0.084 | 0.037 | 0.000 | 0.024 | 0.417 | 0.079 | 0.093 | 0.072 | 0.078 | 0.101 | 0.042 | 0.074 | 0.011 | 0.973 | 1.000 | 0.875 | 0.941 | 1.000 |
| Category1 | 0.000 | 0.521 | 0.008 | 0.000 | 0.006 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.015 | 0.472 | 0.022 | 0.022 | 0.019 | 0.023 | 0.037 | 0.027 | 0.017 | 0.034 | 1.000 | 0.875 | 1.000 | 1.000 | 0.051 |
| Category2 | 0.000 | 0.530 | 0.010 | 0.009 | 0.024 | 0.101 | 0.068 | 0.093 | 1.000 | 1.000 | 0.005 | 0.020 | 0.481 | 0.049 | 0.055 | 0.042 | 0.049 | 0.084 | 0.080 | 0.043 | 0.016 | 0.990 | 0.941 | 1.000 | 1.000 | 0.069 |
| Category3 | 0.000 | 0.489 | 0.004 | 0.000 | 0.027 | 0.014 | 0.051 | 1.000 | 1.000 | 1.000 | 0.000 | 0.006 | 0.000 | 0.011 | 0.012 | 0.004 | 0.008 | 0.023 | 0.035 | 0.000 | 0.022 | 1.000 | 1.000 | 0.051 | 0.069 | 1.000 |
| Patient_ID | Health_Camp_ID | Registration_Date | Var1 | Var2 | Var3 | Var4 | Var5 | outcome | Online_Follower | LinkedIn_Shared | Twitter_Shared | Facebook_Shared | Income | Education_Score | Age | First_Interaction | City_Type | Employer_Category | Camp_Start_Date | Camp_End_Date | Category1 | Category2 | Category3 | Donation | Health_Score | Health Score | Number_of_stall_visited | Last_Stall_Visited_Number | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 526927 | 6570 | 14/05/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 14-Nov-04 | NaN | NaN | 09-Jul-05 | 22-Jul-05 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 1 | 510379 | 6534 | 26/05/06 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | None | None | None | 26-May-06 | H | NaN | 17-Oct-05 | 07-Nov-07 | Second | A | 2 | NaN | NaN | 0.402054 | NaN | NaN |
| 2 | 520968 | 6557 | 07/01/04 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | None | None | None | 07-Jan-04 | H | NaN | 04-Jan-04 | 09-Jan-04 | First | C | 2 | 20.0 | 0.611111 | NaN | NaN | NaN |
| 3 | 507625 | 6535 | 12/02/04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 12-Feb-04 | B | NaN | 01-Feb-04 | 18-Feb-04 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 4 | 502611 | 6581 | 14/03/04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 14-Mar-04 | B | NaN | 07-Dec-03 | 13-Jun-04 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 5 | 487442 | 6526 | 07/01/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 30-Nov-04 | NaN | NaN | 03-Jan-05 | 20-Feb-05 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 6 | 510807 | 6543 | 29/12/06 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 29-Dec-06 | NaN | NaN | 27-Sep-05 | 07-Nov-07 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 7 | 502795 | 6539 | 22/09/04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 21-Sep-04 | NaN | NaN | 07-Aug-04 | 12-Feb-05 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 8 | 489318 | 6532 | 08/04/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 24-Jan-05 | NaN | NaN | 19-Feb-05 | 23-Aug-05 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 9 | 507386 | 6543 | 07/11/06 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 31-Oct-06 | NaN | NaN | 27-Sep-05 | 07-Nov-07 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| Patient_ID | Health_Camp_ID | Registration_Date | Var1 | Var2 | Var3 | Var4 | Var5 | outcome | Online_Follower | LinkedIn_Shared | Twitter_Shared | Facebook_Shared | Income | Education_Score | Age | First_Interaction | City_Type | Employer_Category | Camp_Start_Date | Camp_End_Date | Category1 | Category2 | Category3 | Donation | Health_Score | Health Score | Number_of_stall_visited | Last_Stall_Visited_Number | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 52684 | 516363 | 6532 | 03/03/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 53 | 55 | 11-Dec-04 | C | NaN | 19-Feb-05 | 23-Aug-05 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 52685 | 511809 | 6526 | 05/11/04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | 72 | 03-Oct-04 | C | Technology | 03-Jan-05 | 20-Feb-05 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 52686 | 494756 | 6542 | 07/04/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 26-Mar-05 | NaN | NaN | 19-Feb-05 | 23-Aug-05 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 52687 | 519167 | 6543 | 19/07/06 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 12-Feb-06 | B | NaN | 27-Sep-05 | 07-Nov-07 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |
| 52688 | 495320 | 6570 | 14/05/05 | 3 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | None | None | 43 | 08-Feb-03 | B | NaN | 09-Jul-05 | 22-Jul-05 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 52689 | 528062 | 6529 | 03/03/06 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | None | None | None | 22-Apr-05 | H | NaN | 30-Mar-06 | 03-Apr-06 | Second | A | 2 | NaN | NaN | 0.161641 | NaN | NaN |
| 52690 | 513331 | 6546 | 10/01/04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 10-Jan-04 | NaN | NaN | 09-Jan-04 | 17-Jan-04 | First | E | 2 | NaN | NaN | NaN | NaN | NaN |
| 52691 | 486727 | 6523 | 30/04/05 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | None | 43 | 08-Feb-03 | I | NaN | 23-Feb-05 | 16-Sep-05 | Second | D | 2 | NaN | NaN | 0.518797 | NaN | NaN |
| 52692 | 515312 | 6523 | 06/09/05 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 77 | 46 | 01-May-05 | A | NaN | 23-Feb-05 | 16-Sep-05 | Second | D | 2 | NaN | NaN | 0.964457 | NaN | NaN |
| 52693 | 488938 | 6562 | 11/02/05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | None | None | None | 31-Jan-05 | D | NaN | 24-Nov-04 | 02-Jun-05 | First | F | 2 | NaN | NaN | NaN | NaN | NaN |